488 research outputs found
An equivalent formulation for the Shapley value
The final authenticated version is available online at: https://doi.org/10.1007/978-3-662-58464-4_1.An equivalent explicit formula for the Shapley value is provided, its equivalence with the classical one is proven by double induction. The importance of this new formula, in contrast to the classical one, is its capability of being extended to more general classes of games, in particular to j-cooperative games or multichoice games, in which players choose among different levels of participation in the game.Peer ReviewedPostprint (published version
Matching structure and bargaining outcomes in buyer–seller networks
We examine the relationship between the matching structure of a bipartite (buyer-seller) network and the (expected) shares of the unit surplus that each connected pair in this network can create. We show that in different bargaining environments, these shares are closely related to the Gallai-Edmonds Structure Theorem. This theorem characterizes the structure of maximum matchings in an undirected graph. We show that the relationship between the (expected) shares and the tructure Theorem is not an artefact of a particular bargaining mechanism or trade centralization. However, this relationship does not necessarily generalize to non-bipartite networks or to networks with heterogeneous link values
Attacking the V:On the resiliency of adaptive-horizon MPC
Inspired by the emerging problem of CPS security, we introduce the concept of controller-attacker games. A controller-attacker game is a two-player stochastic game, where the two players, a controller and an attacker, have antagonistic objectives. A controller-attacker game is formulated in terms of a Markov Decision Process (MDP), with the controller and the attacker jointly determining the MDP’s transition probabilities. We also introduce the class of controller-attacker games we call V-formation games, where the goal of the controller is to maneuver the plant (a simple model of flocking dynamics) into a V-formation, and the goal of the attacker is to prevent the controller from doing so. Controllers in V-formation games utilize a new formulation of model-predictive control we have developed called Adaptive-Horizon MPC (AMPC), giving them extraordinary power: we prove that under certain controllability conditions, an AMPC controller can attain V-formation with probability 1. We evaluate AMPC’s performance on V-formation games using statistical model checking. Our experiments demonstrate that (a) as we increase the power of the attacker, the AMPC controller adapts by suitably increasing its horizon, and thus demonstrates resiliency to a variety of attacks; and (b) an intelligent attacker can significantly outperform its naive counterpart
The Impatient May Use Limited Optimism to Minimize Regret
Discounted-sum games provide a formal model for the study of reinforcement
learning, where the agent is enticed to get rewards early since later rewards
are discounted. When the agent interacts with the environment, she may regret
her actions, realizing that a previous choice was suboptimal given the behavior
of the environment. The main contribution of this paper is a PSPACE algorithm
for computing the minimum possible regret of a given game. To this end, several
results of independent interest are shown. (1) We identify a class of
regret-minimizing and admissible strategies that first assume that the
environment is collaborating, then assume it is adversarial---the precise
timing of the switch is key here. (2) Disregarding the computational cost of
numerical analysis, we provide an NP algorithm that checks that the regret
entailed by a given time-switching strategy exceeds a given value. (3) We show
that determining whether a strategy minimizes regret is decidable in PSPACE
Efficiency in Multi-objective Games
In a multi-objective game, each agent individually evaluates each overall
action-profile on multiple objectives. I generalize the price of anarchy to
multi-objective games and provide a polynomial-time algorithm to assess it.
This work asserts that policies on tobacco promote a higher economic
efficiency
Group Strategyproof Pareto-Stable Marriage with Indifferences via the Generalized Assignment Game
We study the variant of the stable marriage problem in which the preferences
of the agents are allowed to include indifferences. We present a mechanism for
producing Pareto-stable matchings in stable marriage markets with indifferences
that is group strategyproof for one side of the market. Our key technique
involves modeling the stable marriage market as a generalized assignment game.
We also show that our mechanism can be implemented efficiently. These results
can be extended to the college admissions problem with indifferences
Outer Approximations of Coherent Lower Probabilities Using Belief Functions
We investigate the problem of outer approximating a coherent lower probability with a more tractable model. In particular, in this work we focus on the outer approximations made by belief functions. We show that they can be obtained by solving a linear programming problem. In addition, we consider the subfamily of necessity measures, and show that in that case we can determine all the undominated outer approximations in a simple manner
- …